NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Saliency-Bench: A Comprehensive Benchmark for Evaluating Visual Explanations

https://doi.org/10.1145/3711896.3737414

Zhang, Yifei; Song, James; Gu, Siyi; Jiang, Tianxu; Pan, Bo; Bai, Guangji; Zhao, Liang (August 2025, ACM)

Free, publicly-accessible full text available August 3, 2026
Aligning target-aware molecule diffusion models with exact energy optimization

Gu, Siyi; Xu, Minkai; Powers, Alexander; Nie, Weili; Geffner, Tomas; Kreis, Karsten; Leskovec, Jure; Vahdat, Arash; Ermon, Stefano (December 2024, Advances in neural information processing systems)

Full Text Available
DUE: Dynamic Uncertainty-Aware Explanation Supervision via 3D Imputation

https://doi.org/10.1145/3637528.3671641

Zhao, Qilong; Zhang, Yifei; Zhu, Mengdan; Gu, Siyi; Gao, Yuyang; Yang, Xiaofeng; Zhao, Liang (August 2024, ACM)

Full Text Available
Going Beyond XAI: A Systematic Survey for Explanation-Guided Learning

https://doi.org/10.1145/3644073

Gao, Yuyang; Gu, Siyi; Jiang, Junji; Hong, Sungsoo Ray; Yu, Dazhou; Zhao, Liang (July 2024, ACM Computing Surveys)

As the societal impact of Deep Neural Networks (DNNs) grows, the goals for advancing DNNs become more complex and diverse, ranging from improving a conventional model accuracy metric to infusing advanced human virtues such as fairness, accountability, transparency, and unbiasedness. Recently, techniques in Explainable Artificial Intelligence (XAI) have been attracting considerable attention and have tremendously helped Machine Learning (ML) engineers in understand AI models. However, at the same time, we started to witness the emerging need beyond XAI among AI communities; based on the insights learned from XAI, how can we better empower ML engineers in steering their DNNs so that the model’s reasonableness and performance can be improved as intended? This article provides a timely and extensive literature overview of the field Explanation-Guided Learning (EGL), a domain of techniques that steer the DNNs’ reasoning process by adding regularization, supervision, or intervention on model explanations. In doing so, we first provide a formal definition of EGL and its general learning paradigm. Second, an overview of the key factors for EGL evaluation, as well as summarization and categorization of existing evaluation procedures and metrics for EGL are provided. Finally, the current and potential future application areas and directions of EGL are discussed, and an extensive experimental study is presented aiming at providing comprehensive comparative studies among existing EGL models in various popular application domains, such as Computer Vision and Natural Language Processing domains. Additional resources related to event prediction are included in the article website:https://kugaoyang.github.io/EGL/
more » « less
Full Text Available
Visual Attention Prompted Prediction and Learning

https://doi.org/10.24963/ijcai.2024/610

Zhang, Yifei; Pan, Bo; Gu, Siyi; Bai, Guangji; Qiu, Meikang; Yang, Xiaofeng; Zhao, Liang (August 2024, International Joint Conferences on Artificial Intelligence Organization)

Visual explanation (attention)-guided learning uses not only labels but also explanations to guide the model reasoning process. While visual attention-guided learning has shown promising results, it requires a large number of explanation annotations that are time-consuming to prepare. However, in many real-world situations, it is usually desired to prompt the model with visual attention without model retraining. For example, when doing AI-assisted cancer classification on a medical image, users (e.g., clinicians) can provide the AI model with visual attention prompts on which areas are indispensable and which are precluded. Despite its promising objectives, achieving visual attention-prompted prediction presents several major challenges: 1) How can the visual prompt be effectively integrated into the model's reasoning process? 2) How should the model handle samples that lack visual prompts? 3) What is the impact on the model's performance when a visual prompt is imperfect? This paper introduces a novel framework for visual attention prompted prediction and learning, utilizing visual prompts to steer the model's reasoning process. To improve performance in non-prompted situations and align it with prompted scenarios, we propose a co-training approach for both non-prompted and prompted models, ensuring they share similar parameters and activation. Additionally, for instances where the visual prompt does not encompass the entire input image, we have developed innovative attention prompt refinement methods. These methods interpolate the incomplete prompts while maintaining alignment with the model's explanations. Extensive experiments on four datasets demonstrate the effectiveness of our proposed framework in enhancing predictions for samples both with and without prompt.
more » « less
Full Text Available
MAGI: Multi-Annotated Explanation-Guided Learning

Zhang, Yifei; Gu, Siyi; Gao, Yuyang; Pan, Bo; Yang, Xiaofeng; Zhao, Liang (September 2023, CVF)

Full Text Available
RES: A Robust Framework for Guiding Visual Explanation

https://doi.org/10.1145/3534678.3539419

Gao, Yuyang; Sun, Tong Steven; Bai, Guangji; Gu, Siyi; Hong, Sungsoo Ray; Liang, Zhao (August 2022, Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery & Data Mining)

Full Text Available

Search for: All records